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Abstract. We study the notion of reverse hypercontractivity. We show that reverse hy- 
percontractive inequalities are implied by standard hypercontractive inequalities as well as 
by the modified log-Sobolev inequality. Our proof is based on a new comparison lemma for 
Dirichlet forms and an extension of the Stroock-Varopoulos inequality. 

A consequence of our analysis is that all simple operators L — Id — E as well as their 
tensors satisfy uniform reverse hypercontractive inequalities. That is, for all q < p < 1 and 
every positive valued function / for t > log we have ||e _<i /|| 9 > ||/|| p . This should 
be contrasted with the case of hypercontractive inequalities for simple operators where t is 
known to depend not only on p and q but also on the underlying space. 

The new reverse hypercontractive inequalities established here imply new mixing and 
isoperimetric results for short random walks in product spaces, for certain card-shufflings, for 
Glauber dynamics in high-temperatures spin systems as well as for queueing processes. The 
inequalities further imply a quantitative Arrow impossibility theorem for general product 
distributions and inverse polynomial bounds in the number of players for the non-interactive 
correlation distillation problem with m-sided dice. 
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1. Introduction 

1.1. Background. Log-Sobolev and hypercontractive inequalities play a fundamental role in 
a number of areas in analysis and probability theory including the study of Gaussian processes 
(see, e.g., |Gr o78l [Jan97| ). analysis of Markov chains (see, e.g., |SC97| ) and discrete Fourier 
analysis starting in |KKL88| iTaM] . 

One of the first and most useful hypercontractive inequalities due to Bonami-Nelson-Gross- 
Beckner |Bon70l INel73l IGro75l IBec75| states that if (fi,/x) = ({0, 1}, |(tf + <fi)) then the 
operator T = e - t ( Id ~ E ^) satisfies 

(1.1) \\Ttf\\ P < \\f\\ q , Vf:n^R,p>q>l,t>llog P ~ l 



2 q-1 

In probabilistic language, the operators (Tt)t>o form a Markov semigroup with Tf(x) = 
K[f(Xt)\Xo = x], where (Xt)t>o is a continuous-time Markov chain on Cl where the particle 
jumps from the state y to the state z with probability /j,(z) and the gaps between successive 
jumps are distributed as independent exponential random variables. 

The strength of a simple hypercontractive inequality like (jl.ip lies in the fact that it ten- 
sorizes. This led to many applications in discrete Fourier analysis (starting with [KKL88J) 
and even earlier in the study of Gaussian processes. Extending (jl.ip to other spaces turned 
out to be a non-trivial task. For the case of the spaces (Cl, fi) = ({0, 1}, a5o + (1 — a)5i), with 
a < 1/2, the first bounds were established by Talagrand [Tal94j. Exact formulas have been 
obtained by Oleszkiewicz [Ole03| in the cases where either p>2 = qorp = 2>q>l. Wolff 
then extended these results |Wol07| to general discrete spaces and, in a slightly less precise 
form, to all p > q > 2 and all 2 > p > q > 1: let (f2,//) be a finite probability space with 
a = min^gQ /^{w} > 0; then there exists some universal positive constant e such that for p, q 
as above and certain to = to(p,q,a), given by an explicit though complicated formula, 

t > t => V/ : Cl -> M, ||T t /|| p < ll/H, t > t - e. 

Moreover, lim Q _ s>0 + toiPi 1^ a ) = °°. This dependency on the smallest atom in space is present 
in many applications of hypercontractivity starting with |Tal94j . We note further that the 
same dependency is arrived at using the exact calculation of the log-Sobolev constant of simple 
operators (that is, operators of the form Tt = e _i ( /rf - E M) acting on the function defined on a 
probability space (CI, /j,)) by Diaconis and Saloff-Coste [DSC96]. 

A 'reverse' hypercontractivity is shortly proved and discussed in a paper by Borell |Bor82 
in the 80's. This result, proven for the measure (CI, /j,) = ({0, 1}, ^(^o + ^i)), states that 

(1.2) \\T t f\\ q >\\f\\ p , Vf:Cl^R + , l>p>q, t > 1 log 1 ~ " 



2 1-p 

This inequality which also tensorizes is indeed 'reverse' in many ways. Not only the inequality 
goes 'the other way' and the roles of p and q get reversed, it is also the case that p and q are 
less than 1 (indeed they may be negative(l); note, however, that the function / has to take 
positive values). 

As far as we know, Borell's result was first used in a paper published more than 20 years 
later |MQR + 06 , where it is used to analyze mixing of short random walks on the discrete 



cube {0, l} n as well as to provide tight bounds on the Non-Interactive Correlation Distillation 
(NICD) problem. 



Motivated by generalization of applications in MOR + 06] as well as by other applications 
that will be discussed later, we wish to extend Borell's results to other discrete probability 
spaces. Noting the similarity of the inequalities (II. ip and (II. 2p it is tempting to conjecture 
(as the first named author have done) that the formulas for hypercontracitivity and reverse 
hypercontractivity are 'the same': in particular, for discrete spaces there is a dependency on 
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the size of the smallest atom in space as in the above-mentioned results for hypercontractivity. 
The conjecture is further supported by the fact that for diffusions both hypercontractivity 
and reverse hypercontractivity are equivalent to the standard log-Sobolev inequality (for more 
details see [BGM94] ; some pioneering results relating hypercontractivity to reverse hypercon- 
tractivity were obtained already in [BJ| ). 

The conjecture turns out to be far from true. In fact our results show that for every discrete 
probability space (O,//): 

(1.3) \\T t f\\ q >\\f\\ p , Vf:fl^R + , l>p>q, t > log ^— 1 

1 — p 

In particular, reverse hypercontr active inequalities hold uniformly for all probability spaces. 

It is well known that hypercontractive inequalities are intimately related to logarithmic 
Sobolev inequalities and our proof of (|1.3p is based on extension of this connection to 'norms' 
p < 1 (such extensions were noted before, see, e.g., Bakry's lecture notes |BGM94j ). At 
the heart of the proof is a new monotonicity result showing that under the appropriate nor- 
malization log-Sobolev inequalities are monotone in the norm parameter p for all p £ [0,2]. 
This result in turn is based on an extension of the Stroock-Varopoulos inequality to general 
norms. The result allows us to show how reverse hypercontractive inequalities follow directly 
from standard hypercontractive inequalities and furthermore from standard log-Sobolev and 
modified log-Sobolev inequalities. 

After we develop the theory of reverse hypercontractive inequalities, we derive a number of 
novel results regarding mixing of Markov chains run for short time starting from large sets, 
in general cubes, the symmetric group and Ising configurations (via Glauber dynamics). We 
further derive a quantitative Arrow's Theorem for general distributions and inverse polynomial 
bounds for the NICD problem for general m-sided dice. We proceed with formal definitions 
and statements of the main results. 

1.2. General setup. We now turn to the general mathematical setup of the paper. Let 
(O,//) be a finite probability space (with a natural a- field of all subsets of O). We assume 
//{w} > for weS!. Let E denote the expectation operator: E/ = J n / djj,. We also use the 
standard notation for the variance of /, Var(/) = E/ 2 — (E/) 2 , and the entropy of / > 0, 
Ent(f) = E(/log/) — E/ • logE/. Let H be the space of real- valued functions on Q. Let 
%(0 )O o) denote the strictly positive functions. 

Definition 1.1. Let L : % — > rl be a linear operator such that 

• LI = and 

• L is self-adjoint with respect to the L 2 (Q,[i) structure, i.e., KfLg = EgLf for any 
f,geH, and 

• L is positive semidefinite, i.e., E/L/ > for all / E T-L. 

The Markov semigroup of operators (Tt)t>o '■ H — > ti generated by L is given by 

Ttf = e~ tL f, 

with To/ = / and 4?Ttf = —LT t f = —T t Lf. The Dirichlet form £ : % x % — > R associated 
with L is given by 

£(f, g) = E(fLg) = E(gLf) = £(g, /) = -^E/T^. 

Recall that in the setup of the definition above: Tjl = 1 for t > and £(f,l) = for 
/ G H. The operators Ti are symmetric linear contractions in L p -norm for every p € [l,oo) 
and t > 0. They are mean-preserving, i.e., 



ET t f = ElT t f = E/T t l = E/, 
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and positivity preserving, i.e., Ttf > for / E % and / > 0, thus also order preserving (/ > g 
implies Ttf > Ttg). In fact, they preserve also strict positivity: / > implies Ttf > for 
t > 0. The positivity preserving property implies that |/|) < £{f, f) for any f £ H. 

Definition 1.2. For f € H and p > we denote by [|/[| p the p-th norm of /: (IE| / 1 73 ) 1 ^ - 
We extend the definition top£l and / G T-L(q,oo) by setting = (E/ p ) 1//p for p ^ 0, and 
| = exp(Elog/). 



Recall that || • || p is a true norm for p > 1 but it is only a pseudo-norm (triangle inequality 
fails) for p < 1 (unless |Q| = 1). It is an easy and well-known fact that for any / £ %(o,oo) 
the map p i— > ||/|L is continuous and non-decreasing. 

Following Borell |Bor82] we extend the definition of duality to p £ R: 

Definition 1.3. For a number p£R\ {0, 1} we define its (Holder) conjugate p' = p/{p — 1), 
so that i + = 1. We also set 0' = 0. 

Note that the map p \— > p' is a continuous order-reversing involution on (— oo, 1) and (1, oo) 
with fixed points and 2. It is worth observing that (2 — p)' = 2 — p' for p ^ 1, even though 
we will not make use of this fact. 

1.3. Log-Sobolev inequalities. We now recall the definition of log-Sobolev inequalities. 
Definition 1.4. For \ {0, 1} we say that p-logSob is satisfied with constant C > if 

C-r? 

(1-4) Ent(n< 4^3i) f (/ P_1 '/) 

for every / € %(o,oo)- We will say that 1-logSob is satisfied with constant C > if 

Ent(f)< jS(f, log f) 
for / G %(o,oo)- Finally, we will say that 0-logSob is satisfied with constant C > if 

Var(log/)<-^(/,l//) 

for every / € %( ,oo)- 

Remark 1.5. Obviously, the cases p = and p = 1 of the p-logSob inequality are limit cases 
of the p-logSob for \ {0, 1} (with the same C). 

Remark 1.6. Logarithmic Sobolev inequalities were introduced by Gross in his seminal paper 
|Gro75| . Gross defined logarithmic Sobolev inequality for p > 1. The definition was later 
extended by Bakry [BG M94] to any real p (including 1-logSob inequality). However, Gross's 
definition is more general than (11 .41) as it involves an extra term of the form 7E[/ P ] on the right 
hand side of the inequality, where 7 is some nonnegative constant. This is sometimes called 
defective logarithmic Sobolev inequality and has been studied extensively in the literature. 
Gross's definition reduces to (jl.4p when 7 = and p > 1. Finally, we remark that 1- 
logSob inequality is also known in the literature as modified log-Sobolev inequality (see, 
e.g., |Wu001 [GQM| IGoe04l IBT06] '! . The 1-logSob inequality is also called "entropic inequality" 



as it implies the exponential decay of entropy along the semigroup. 

Remark 1.7. Our definition uses a novel and non-standard normalization factor 4 ^'_ 1 ) m 
p.4p . The choice of this normalization makes our p-logSob constants invariant under Holder 
conjugation (see Lemma [3. 2|) . Moreover, this normalization is crucial to prove the main result 
of the paper - the monotonicity of the inequality for p £ [0,2] (see Theorem ll.8p . 
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In our main result we prove a general result relating p-logSob inequalities for different 
values of p. 

Theorem 1.8. Let < q < p < 2. Assume that p-logSob holds with a constant C > 0. Then 
also q-logSob holds true with the same constant C . 

Remark 1.9. It has been proved in [BGM94, Proposition 3.1] that if 2-logSob holds with 
constant C, then any p-logSob also holds with the same constant and the converse is true in 
case of diffusions with invariant measure \x. 

Using the fact that simple operators satisfy 1-logSob with the constant 4 (proved in [BT06J ; 
the proof is reproduced in our Lemma 13.51 below) we obtain the following corollary: 

Corollary 1.10. Assume that the semigroup {Tt)t>o * s generated by L = Id — E or by a tensor 
of simple operators. Then it satisfies the r-logSob inequality with constant 4 for all r G [0, 1]. 

1.4. Reverse hypercontractive estimates. Using Theorem 11.81 we derive the following 
general reverse hypercontractive bounds: 

Theorem 1.11. If a symmetric Markov semigroup (Tt)t>o satisfies r-logSob with constant 
C and r > 1 then for all q < p < 1 and every f G %(o,oo) f or all t > ^ log we have 
\\Ttf\\ q > ll/llp. 

Using Corollary 11.101 this implies in turn that: 

Corollary 1.12. If a symmetric Markov semigroup {Tt)t>o has a simple generator L = Id — E 
or it is a tensor product of such simple semigroups, then for allq < p < 1 and every f G W(o,oo) 



for t > log j^L we have \\T t f\\ q > 



p- 



In fact, for simple operators we derive the following stronger result: 

Theorem 1.13. Assume that a symmetric Markov semigroup (Tt)t>o has a simple generator 
L = Id — E or it is a tensor product of such simple semigroups. Let f £% be strictly positive. 
Then for all q < p < and t > log jE^j an d a ^ so f or all < q < p < 1 and t > log (i_p)(2-<?) 
we have \\T t f\\ q > \\f\\ p . 



Remark 1.14. The reverse hypercontractive inequality (|1 .2j) for a simple operator and the 
space { — 1, 1} with the uniform measure was derived prior to ours by Borell. His result is 
tight. 

1.5. Application 1: Mixing of large sets in Markov chains. Our first application of 
the new inequality is to mixing of Markov chains from large sets. The statement and proof of 
the theorem below are a generalization of the main result of |MOR+06| where it was proven 
for the random walk on the discrete cube {0, l} n . 

Theorem 1.15. Let (Xt)t>o be a continuous-time Markov chain on a finite state space £1 
which is reversible with respect to the invariant probability measure it. Let (Tt)t>o be the 
semigroup defined by Tff(x) = E, x f(Xf) for f € %. Assume (Tf)t>o satisfies 1-logSob with 
constant C. Let a, b > and let A,BCQ with tt{A} = exp(— a 2 /2) and tt{B} = exp(— b 2 /2). 
Let Xq be distributed according to it. Then 



(1.5) F{X G A, X t G B} > exp 



la 2 + 2e- 2t / c ab + b 2 
2 1 - e" 4 */ c 



This theorem should be compared to the two main techniques for proving lower bounds on 

F{X eA,X t eB}. 
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• First, the Expander Mixing Lemma (see, e.g., A SON. Chapter 9]) implies that if 
Poincare inequality holds with constant D then: 

(1.6) ¥{X EA,X t EB}> tt{A}tt{B} - y/7r{A}ir{B}e- t/D . 

The inequality (jl.6p will be better (up to constants) than our inequality (jl.5p in the 
case where the sets A and B are large, say tt{A}tt(B) > 2y / 7r{A}7r{i?}e _ */ £) , since 
in this case we obtain the lower bound of ^tt{A}tt{B} which is (except for the factor 
2) the best that one can hope for. However, in the case where the sets A and B 
are small, say tt{A}tt{B} < ^Jir{A}-K{B}e~ t / D , the expander mixing lemma gives 
nothing while (|1.5p gives a lower bound that is a power of the measures of the original 
sets. 

• The second technique uses total variation mixing times. Indeed, if the worst total 
variation distance at time t is at most e, then we have: 

(1.7) P{X G A,X t G B} > it{A}(it{B} - e). 

Again - applying this bound requires that one of the sets A or B is large (of measure 
at least e). Moreover, in many examples the time t when the total variation distance 
is at most 1/e is much larger than the 1-logSob constant C . Therefore if t is of order 
C, then the mixing time bound (jl.7j) gives nothing while our result (II. 5p gives an 
efficient lower bound. 

In Section we demonstrate this point by proving new mixing bounds from large sets for 
various classical Markov chains including: 

• Random walks on general product spaces. 

• Glauber dynamics on Ising model on finite boxes. 

• The random transposition card shuffle on the symmetric group. 

• The top to random card shuffle on the symmetric group. 

• A natural random walk on the spanning trees of a graph considered in [JS02J. 

• The Bernoulli-Laplace model, i.e., the natural random walk on subsets of size r of a 
ground set [n]. 

• A natural Markovian queueing process - the q/q/oo Markov process. The last example 
is interesting since it has infinite 2-logSob constant and an infinite mixing time. 

1.6. Application 2: A general quantitative Arrow theorem. Arrow's Impossibility 
Theorem |Arr50t IArr63| is a fundamental result in social choice theory. Let us consider a 
situation where we have n voters and k candidates. Arrow considered functions F : S2 — > 

{ — 1, l}(a) that aggregate individual rankings (elements of the permutation group Sk) to result 
in a preference between every pair of the k alternatives. Arrow showed that if the following 
desired properties hold simultaneously when k > 3: 

• Transitivity - F(a) induces a transitive ranking for all a G S2, 

• Unanimity - for every pair of alternatives a and b, if all voters rank a above b then F 
also ranks a above b, 

• Independence of irrelevant alternatives (HA) - for every pair of alternatives, the result- 
ing outcome regarding the preference between a and b is determined by the individual 
preferences between a and b, 

then F is a dictator function, i.e., it is determined by a single voter. 

It is natural to ask how robust is the result when considering natural distributions over 
ST. This question was analyzed by Kalai [KalQ2j who studied it for the case of the uniform 
distribution over S% and showed that for every e > 0, there exists a 5 > such that if F 
satisfies: 
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• 5- Transitivity: ¥{F(a) is transitive} > 1 — S, 

• Fairness: for every pair of alternatives a and b, ¥{F ranks a above b} = 1/2. 

• IIA, 

then there exists a dictator function G such that ¥{F(a) ^ G(o~)} < e. 

Following a challenge by Kalai, Mossel |Mosl2] proved a stronger result for any number of 
alternatives and without the assumption that F is fair. His result shows that for k > 3 and 
every e > 0, there exists a 5 > such that if F satisfies 

• S- Transitivity: ¥{F(a) is transitive} > 1 — 5, 

• IIA, 

then there exists a function G, which is transitive and satisfies the IIA property, such that 
¥{F(a) ^ G(a)} < e. A complete characterization of all functions G that are IIA and 
transitive is given by Wilson |Wil72j - these functions include dictators, functions taking two 
values etc. 

A key ingredient of the proof in [Mosl2j is the use of reverse hypercontractive inequalities. 
It is further noted in |Mosl2| that it should be possible to extend the proof to general product 
distributions on S% given appropriate reverse hypercontractive bounds for general two point 
spaces. Here we prove the following extension: 

Theorem 1.16 (Quantitative Arrow's theorem for general distribution). Let g be general 
distribution on Sf. with g assigning positive probability to each element of Sfc. Let P denote 
the distribution g® n on S2- Then for any number of alternatives k > 3 and e > 0, there exists 

5 = 6(e, p) > 0, such that for every n, if F : S2 — > { — 1, l}^ 2 ) satisfies 

• IIA and 

• P{i ? (cr) is transitive} > 1 — 5. 

Then there exists a function G which is transitive and satisfies the IIA property and ¥{F(a) ^ 
G(a)}<e 

We note that considering general product distributions gives a more realistic model of 
actual voting (though the independence assumption in this line of work is still problematic in 
real voting scenarios). 

1.7. Application 3: Non-interactive correlation distillation from dice source. The 

problem of non-interactive correlation distillation deals with players who receive correlated 
random strings and whose collective goal is to agree with the highest possible probability on 
a random variable with a given distribution. Suppose there are k > 2 players and a 'cosmic 
source'. Assume first that the source generates a string x of n i.i.d. bits. Each player gets 
to receive an independent noisy copy of x. Each player then produces a single random bit 
based on her input. The players wish to have unanimous agreement on their outputs but are 
not allowed to communicate. The problem is to understand to what extent the players can 
successfully 'distill' the correlations in their strings into a shared random bit. 

This problem has been considered in |AMW91[ IMO05[ IYan04| lMOR+06) when the input 



strings consist of i.i.d. fair coin flips. Here we consider a more general version of the problem 
where each bit is an outcome of a throw of a fair dice with m-faces, m > 2. Let us introduce 
some notations. Let 0, = {1, 2, . . . , m} denote the set of the possible outcomes of a dice. Let 
x = (xi,X2, • • • ,x n ) be a random vector consisting of n i.i.d. random variables, each being 
uniformly distributed over O. Fix p G [0,1). Let y be a p-correlated copy of x (that is, for 
each j < n independently, with probability p, yj = Xj, and with probability (1 — p), yj = x'j, 
x' being an independent copy of x) and let (y t )\<i<k be conditionally independent copies of 
y given x. Let player i use the function Fi : £l n -f 11 to produce her output Fi(y l ). The 
functions Fi are all assumed to be balanced, that is, ¥{Fi(x) = j} = m^ 1 for all i, j. 



REVERSE HYPERCONTRACTIVITY 



9 



Define 



M p (k,n) = sup P{ all players output the same bit}, 

(^i)l<i<fe 



where the supremum is taken over all choices of balanced functions (i*i)i<i<fc. Since a balanced 
function defined on n variables can be also thought of a balanced function of (n+ 1) variables, 
for a fixed k and p, A4 p (k,n) is a non-decreasing function of n and so, lim n ^ ^ M p (k,n) 



exists. One of the main results of [MOR + 06 says that when m = 2, we have 



lim A4p(k,n) = k as k — > oo. 

n— >oo 

The upper bound of the above result uses an application of reverse hypercontractivity for 
simple semigroup on symmetric two-point space. Here we generalize this bound and give an 
inverse polynomial bounds (in k) on the agreement probability for general m. 

Theorem 1.17. Fix p G (0, 1). Then there exist positive constants 71 = 7i(/?),72 = 72(p)> c i = 
Ci(m, p) and 02 = 02(111, p) such that for all k>2, 



c 2 k 72 < lim M p (k,n) < c\k' 



■71 



Acknowledgements. The first named author enjoyed the hospitality of Isaac Newton Institute, 
Cambridge while completing part of this research. The second named author enjoyed hospi- 
tality of University of California, Berkeley and Isaac Newton Institute, Cambridge while doing 
this research. We thank Dominique Bakry, Franck Barthe, Nick Crawford, Michel Ledoux and 
Cyril Roberto for helpful comments and discussions. 

2. Comparison of Dirichlet forms 

The following theorem extends the classical Stroock-Varopoulos inequality [Str84, Var85j 
(covering the case p = 2, q G (1,2] of the present result). Theorem 12.11 is the main tool in 
proving Theorem 1 1.81 Note that some terms in the statement below may take negative values. 

Theorem 2.1. Let p,q G (0, 2] \ {1} and p> q. Then 

qq'£{g l li,g l /<i')>pp'£{g l lv,g l lv') 

for every g G H(p )00 y 

Remark 2.2. The above result has a natural extension to the case p = 1 or q = 1, with 
£(log g,g) replacing the right (resp. left) hand side of the asserted inequality; then it suffices 
to use functions ipi{x) = logx and (f2( x ) = x ( or V ; i( x ) = logx and ip2(x) = x, respectively) 
in the proof. 

The proof of the theorem will use the following lemmas. 

Lemma 2.3. Let L be a non-empty convex subset o/M. Assume that some functions (pi, ^2,^1,^2 
I — > R satisfy 

(y>i(a) - V 9 1 (6))( V 9 2 (a) - p 2 (&)) < (^i(a) - ^(6))(V> 2 (a) - ^(6)) 
for all a,b G I. Then for every f : f2 — > I there is 

where by <pi(f) we denote ip\ o f G TL, etc. 
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Proof. By applying order preserving and linear T% to a pointwise inequality 

(Mf) - Mb))(Mf) - Mb)) < (Mf) - Mb))(Mf) - Mb)) 

we obtain the inequality 

Tt(Mf)Mf)) - Mbm(Mf)) - Mb)T t (Mf)) + Mb)Mb) < 
T t (Mf)Mf)) - Mb)T t (Mf)) - Mb)T t (Mf)) + Mb)Mb) 

which also holds pointwise, so that it yields yet another pointwise inequality: 

T t (Mf)Mf)) - Mf)Tt(Mf)) - Mf)Tt(Mf)) + Mf)Mf) < 
T t (Mf)Mf)) - Mf)T t (Mf)) - Mf)T t (Mf)) + Mf)Mf)- 

By averaging both sides of it and using the fact that Ti is symmetric and mean-preserving we 
arrive at 

(2.i) EMf)Mf) - ^Mf)Tt(Mf)) < ^Mf)Mf) - ^MmiMf))- 

Let a : [0, oo) — > R be given by 

a(t) = E^(/)T t (</> 2 (/)) - E^(/)T t (^ 2 (/)). 
By (|2.ip the function a has a local minimum at zero, so that 

s (MflMf)) -s (Mf),Mf)) = «V(o) > o. 

□ 

Lemma 2.4. Assume that I is a convex non-empty subset o/R and functions ifi, tf2, ipi, 
tp2 ■ I — > R are differentiate and such that 

M°)Mb) + Mb)Ma) < Ma)Mb) + Mb)M*) 
for all a,b £ I. Then for every f : Q, —} I there is 

s(Mf),Mf))<B(Mf),Mf))- 

Proof By Lemma 12,31 it suffices to check whether $ : I X I — >• R given by 

$(a,6) = (Ma) ~ Mb))(Ma) ~ Mb)) ~ fai(a) - Mb))(M^) ~ Mb)) 
is nonnegative. Clearly, $>(x,x) = and j$-(x,x) = for all x & I. Now it is enough to 
notice that the assumptions of Lemma 12,41 yield JjJ^ > which implies that <!>(•, x) is 
non-decreasing on [x, oo) Pi J and non-increasing on (— oo, x] I. □ 

We are now ready to prove Theorem 12.11 

Proof. It suffices to use Lemma [2.41 with I = (0, oo), fi(x) = px 1 l' p 1 f2( x ) — p'x x l p \ ^pi(x) = 
qx l / q , and ip2(x) = q'x l l q ' . Indeed, to verify the assumptions of Lemma 12.41 one needs to 
check whether 

(a/b) l/p + {a/b)- l ' p < {a/bfl q + (a/b)- 1 '* 
for all a, b > 0. This is, however, obvious since the function kihs" 1 ! s~ w is even and convex 
for every s > and thus it is non-decreasing on (0, oo). Choosing s = a/b and recalling that 
< 1/p < 1/q ends the proof. □ 

We further obtain the following. 

Corollary 2.5. For any f € ^(0,00) there is 

£(log /, log /)<-£(/, 1//). 

Proof. It follows immediately from Lemma 12.41 applied to / = (0, 00) with (fi(x) = logx, 
y>2(x) = logx, ipi(x) = x, and ip2(x) = —l/x. □ 
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3. Logarithmic Sobolev inequalities 

In this section we prove various properties of log-Sobolev inequalities and in particular 
Theorem 11.81 We begin with a simple claim relating 0-logSob to the Poincare inequality. We 
suspect that both Lemma 13.11 and Lemma 13.21 below were previously known in the literature 
but we did not find any explicit reference. 

Lemma 3.1. 0-logSob holds with constant C if and only if the standard Poincare inequality 
holds with constant C/2, i.e., 

Varfo) < j£{g t g) 

for every g E H. 

Proof. For g E Ti and 5 > set / = e s ® . Assuming that / satisfies 0-logSob with constant C 
we obtain 

Var(^)<-^£(Ae-^). 
By the homogeneity of variance and bilinearity of E we get 

Va r(9 >4<^,I^)^§ £ ( 9 , 9 >. 

On the other hand, let / E H be positive and assume that the Poincare inequality holds with 
constant C/2. By using it for g = log / we arrive at 

Var(log/) < ~£ (log/, log/) < ~£(f,l/f), 
where we have used Corollary 12.51 □ 

The following easy observation allows us to restrict study of the p-logSob inequalities to 
the case p E [0, 2] (also, it reveals that 1-logSob is, in a sense, a replacement for ±oo-logSob). 

Lemma 3.2. If p E K \ {1} and p-logSob is satisfied with a constant C then also p'-logSob 
holds, with the same constant. 

Proof. For p = there is nothing to prove, whereas for p ^ it suffices to notice that by 
setting g = f p we obtain an equivalent 'self-dual' version of p-logSob: 

(3.1) Ent(g)<^£(g^,g^') 

for all positive g E H. □ 

We now prove Theorem 1 1 . 8 1 using the extension of the classical Stroock-Varopoulos inequal- 
ity proven in Theorem 12.11 



Proof. It is a direct consequence of the 'self-dual' reformulation (|3.ip of p-logSob, Theorem 12. 11 
and Remark 12.21 The fact that p-logSob with constant C implies 0-logSob (with the same 
constant) for every p / may be proved in two natural ways. One can deduce the Poincare 
inequality with constant C/2 from p-logSob by setting / = e^ 9 and letting 5 tend to zero 
(as in the first part of proof of Lemma 13. 1\ and use Lemma 13.11 to finish the argument). 
Alternatively, one can first deduce from p-logSob the q-logSob inequalities (with the same 
constant) for q arbitrarily close to zero, and then simply apply limit transition q — > 0. □ 

In view of Lemma [3. II and Theorem 11.81 the p-logSob inequalities, p E [0, 2], can be treated 
as a family interpolating in a continuous and monotone way between the classical Poincare 
and logaritmic Sobolev inequalities. Another approach to the interpolation problem may be 
found in [LO00J. Relation between the two approaches seems unclear to the present authors 
and perhaps it deserves some further investigation. 
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However, it is well known that all the p-logSob inequalties for p £ (1,2] are in a sense 
equivalent, at least if we do not care too much about constants (we do not know whether all 
p-logSob inequalities for p £ (0, 1) are equivalent in a similar sense). 

Proposition 3.3. Let 1 < q < p < 2. Assume that q-logSob holds true with a constant C > 0. 
Then also p-logSob holds true, with constant fezjCT C ■ 

Remark 3.4. It follows from [BGM94, Proposition 3.1] that in case of diffusions with invariant 
measure fi any g-logSob implies any p-logSob with the same constant. However, we did 
not find in the literature any reference to the results of the same form as Proposition 13.31 
regarding reversible Markov chains. On the other hand, one can first deduce from g-logSob a 
hypercontractive inequality and then deduce 2-logSob from it, which in turn yields p-logSob. 
This way around was known before but it yields much worse estimates. 

Proof. Indeed, since = $ it suffices to prove that £{g 1/q , g 1/q ') < £(g 1/p , g l,p ') for 

every positive g £ TL, which follows easily from Lemma 12.31 applied to / = (0,oo), (fi(x) = 
x l / p , f2(x) = x l /' p , ipi(x) = x 1 / 9 , and ip2(x) = x 1 ' 9 . The inequality 

(^i(o) - ^i(6))(^(o) - Mb)) < (Ma) ~ Mb))(Mo) ~ Mb)) 
is equivalent to 

(a/b)p~^ + (a/b)^~p < (a/b)?~^ + (a/b)*~9. 

Since for every s > the function w \— > s w + s~ w is non-decreasing on [0,oo), we finish the 
proof by setting s = a/b and noting that - — ^>| — ^>0. □ 

Usually it is not easy to prove the classical logarithmic Sobolev inequality (2-logSob in our 
notation). On the other hand, the following lemma provides a modified logarithmic Sobolev 
inequality (1-logSob in our notation) for a large class of simple semigroups. 

Lemma 3.5. Assume that the semigroup (Tf)t>o is generated by L = Id — E. Then it satisfies 
1-logSob with constant 4, i.e., 

Ent(f) <£(f, log f) 

for all positive f £ TL. 

Proof. Indeed, it suffices to note that the logarithm function is concave on (0, oo), so that 
Elog/ < logE/. Thus 

Ent(f) = Ef log f — Ef ■ log Ef < Ef log / - Ef ■ E log / = 
E/(log / - E log /) = EfL log / = £(f, log /). 

□ 

Remark 3.6. Lemma [3.51 was proved in |BT06] . We remark that the constant 4 is not always 

the optimal 1-logSob constant for the semigroups generated by L = Id — E. For example, for 

the two point space ({0, 1},o5q + (1 — a)5i) the best 1-logSob constant C is known [BT06] 

to satisfy C < , 4 < 4. For a = i, the best 1-logSob constant is 2 IBT06I. When 

— 1+2^/0(1-0) 2 ' b 1 ' 

a^, the best 1-logSob constant is not known (though the best 2-logSob constant is already 

known [DSC96]). It's not difficult to show that the best 1-logSob constant is strictly greater 

than 2 when a \. 
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3.1. Tensor ization. The p-logSob inequalities obviously share the tensorization property of 
the classical logarithmic Sobolev and Poincare inequalities. This is a standard observation 
but we include it here for reader's convenience. For i = 1, 2, . . . , n assume that (Jlj,/Xj) 
is a finite (this assumption may be relaxed) probability space with an associated space Hi 

of real functions on fij, and a Markov semigroup (T^)t>o : Hi — > Hi generated by a self- 
adjoint positive semi-definite operator Lj (all of them enjoying properties described in the 
Preliminaries section). Now let us consider a new semigroup {T t )t>o of operators acting on a 
space H = Hi X H2 X ■ ■ ■ X H n of real-valued functions on a product probability space 

($7, ju) = (fii x Q.2 x . . . x S7 n , hi <g) /x 2 • • • <8> Mn)- 

We obtain it by defining its generator L : % — > % as 

n 

L = Y^ Id Hi ® • • • ® ... i"d Wn . 

i=i 

Equivalently, we may define it by setting, for t > 0, 

T t = T t (1) ®T t (2) ®...®T t (n) . 

Proposition 3.7. Let p € R. In f/ie setting as above, assume that there exist positive 
constants C\, C2, • • • , C n swc/i i/ierf £/je semigroup (T t ^) t > satisfies p-logSob with constant 
Ci for i = 1, 2,..., ra. TTien f/ie semigroup (T t )t>o satisfies p-logSob with the constant 
C = max(Ci,C 2 , . . . ,C n ). 

Proof. We skip the proof, referring the reader to the classical tensorization argument: subad- 
ditivity of entropy (or variance, if p = 0). □ 



4. HYPERCONTRACTIVITY 

4.1. Control of moments under semigroup action. Let i = t(p) be a differentiable 
nonnegative function defined on a convex subset of R \ {0, 1}, and let / G H(p j00 y We will 
study behavior of the moments of functions /^ := T t ^f. An elementary though tedious 
standard calculation shows that 

(41) d Ent(f^)-pH'(p)S(f^, kp) ) 

(4-1) Tp log lir^/ii, . 

Since, as explained in the preliminaries, /^ is also strictly positive, we may apply to it the 
p-logSob inequality, which will yield monotonicity of the map p i-> ||Tt( p )/|| p upon appropriate 
choice of the function t(p). 

4.2. Hypercontractivity estimate. 

Proposition 4.1. Let r £ (1,2] and let (Tt)t>o be a symmetric Markov semigroup. 

Assume that (T t ) t >o satisfies r-logSob with constant C . Let r'<q<porl<q<p<r. 
Then for every t > ^ log |3[ and every f 6 H there is ||T t /|| p < \\f\\ q . In other words, T t is 
a linear contraction from L q (Q,u) to L p (f2,/i). 

Conversely, if there exists C > such that 

(4-2) ||Tc log£= i/|| p < H/ll, 

4 & q— 1 

/or a// p and q such that 1 < q < p < r, and for all positive f G H then (T t )t>o sastisfies 
r-logSob with the constant C . 
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Remark 4.2. That the concepts of logarithmic Sobolev inequality and hypercontractivity are 
intimately connected goes back to Gross |Gro75| . In fact, the converse part of the above propo- 
sition follows from Theorem 1.2 of |Gro75j though we add a short proof here for the sake of 
completeness. But the hypothesis of the forward direction (r-logSob implies hypecontractiv- 
ity) of our proposition is weaker than that of |Gro75] since |Gro75| assumes that r-logSob 
holds for a nonempty open interval - see Theorem 1.1 of [Gro75j for more details. We also 
comment that for r = 2 we recover the part (i) and (ii) of Theorem 3.5 of Diaconis and 
Saloff-Coste [DSC96J on the classical equivalence of the logarithmic Sobolev inequality and 
hypercontractivity for the reversible Markov chains (with essentially the same proof). 

Proof. In the proof of the first assertion without loss of generality we can assume that / > 
- indeed, since T± is order preserving, the pointwise inequality — 1/| < / < |/| implies that 
\Ttf\ < Tt\f \ pointwise, and thus ||Tj/|| p < ||It|/| || p whereas / and |/| have the same q- 
th norm. Furthermore, without loss of generality we may assume that / is strictly positive 
(which follows by considering functions / + e instead of / and then letting e — > + ). 

Theorem 11.81 and Lemma 13.21 imply that (Tt)t>o satisfies s-logSob with constant C for all 
s £ (l,r] U [r',oo). Let t(s) = f log^, so that t(q) = 0. Then s 2 t'(s) = -0^ and (H~T|) 
together with the s-logSob imply that the map s h-> ||Ti( s )/|| s in non-increasing on [q,p\. 
Comparing its values at the ends of the interval we arrive at ||7t /|| p < \\f\\ g for / > 0, 
where t Pi g = ^ log . To finish the proof of the first assertion for t > t Pt g it suffices to 
express Tt as Tt-t v q o T* , and use the fact that the semigroup is contractive in L p -norm 
(p>l). 

To prove the second assertion let us fix some q £ (1,^) and some positive / G H. For 
p S [q, r) let t(p) = ^l°g|Ei> so that Kq) = 0- Since the map p \-} \\T t ( p )f\\ p is non- 
increasing on [q, r) by using ([4.1|) at p = q we infer that g-logSob holds true with the constant 
C. Passing to the limit q — > r~ ends the proof. □ 

5. Reverse hypercontractivity - preliminary results 
In this section we prove some preliminary results regarding reverse hypercontractivity. 

5.1. Reverse contraction. We begin with the following standard fact showing 'reverse con- 
traction': 

Lemma 5.1. Let I be a non-empty convex open subset ofM. and let (?t)t>o be a symmetric 
Markov semigroup. Then for every t > and every concave $ : I — > M there is E$(TJ/) > 
E$(/) for all f £ H with values in I. In particular, for every q < 1 and positive f £ H we 
have \\T t f\\g > \\f\\ q . 

Proof. Indeed, $ may be expressed as infimum of a family C$ of affine functions: 

= inf-O(x); <p £ C$} 

for x £ I. Thus from the pointwise inequality <£(/) < </>(/) and positivity preserving by Tt we 
deduce TMf) < T t <f>(f) = <f>(T t f) for all <j> £ C$ and hence T t $(/) < inf{0(T t /); <fi £ C$} = 
$(T t f), pointwise, again. So E$(/) = ET t (*(/)) < E$(r t /), and we are done. The fact that 
also Ttf has values in / is a consequence of the order preservation. □ 

Note that the lemma used for I = M and <J?(x) = — \x\ p , p > 1 implies the contractivity of 
(T t ) t >o in L p -norm. 
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5.2. Duality and tensorization. The standard statement of the duality of L p -norms is that 
for p > 1 and / £ % we have 

\\f\\ p = sup{E/g; \\g\\ p , < 1}. 
A slightly less known observation can be found in |Bor82| : 
Lemma 5.2. Let p 6 (— oo, 1). T/ien /or any positive f £ 7i there is 

\\f\\ p = M{Efg;g>0,\\g\\ p > > 1}. 
We skip its proof since it is an easy exercise. 

The standard duality of the L p -norms implies that LP (f2, /x) is Banach space dual to 
L p (f2,/u) for any p > 1, and from the symmetry of the semigroup (Tt)t>o we deduce that 

\\ T t\\Lp(n,ti)^Li(n^) = \\Tt\\ Lq > (fl^^Lp' (n,fj.) 

for any p, q > 1 and i > 0. 

The case p,q £ (— oo, 1) is less standard and a bit more delicate (in particular, note that 
this is no longer the Banach space setting). We will need the following auxiliary result which 
was previously used by Borell [Bor82j. 

Proposition 5.3. Let p,q £ (— oo, 1) and t > 0. Assume that \\Ttf\\ q > for every 

positive f £ H. Then also ||Tt/|| p / > for every positive f £ H. 

Proof. Indeed, 

\\T t f\\ p , = M{EgT t f; g > 0, \\g\\ p > 1} = 
mi{EfT t g; g > 0, \\g\\ p > 1} > M{Efh; h > 0, \\h\\ q > 1} = ||/|| ? /, 
where we have used Lemma 15.21 the symmetry of Ti, assumptions of the proposition, and 
again Lemma 15.21 □ 

Lemma 5.4. Assume the set-up of Subsection \3.1\ Let — oo < q < p < 1. If for each 
1 < i < n, \\T^ f\\ q > H/llp for all positive functions f £ Hi, then ||7t/|| 9 > for all 

functions f £ ^(0,00)- 

Proof. The proof is an easy modification of the standard argument for showing the usual 
hypercontractive inequalities tensorize where Minkowski inequality is to be replaced by the 
reverse Minkowski inequality (Lemma 15. ip . We omit details. □ 

6. REVERSE HYPERCONTRACTIVITY - GENERAL RESULTS 

We establish an analogue of Proposition 14 . 1 1 for p and q below 1, extending results of Borell, 
|Bor82]. Now we restrict our considerations to positive functions. 

Proposition 6.1. Let r £ (0,1) and let (Tf)t>o be a symmetric Markov semigroup. 

Assume that (Tt)t>o satisfies r-logSob with some constant C > 0. Let r < q < p < r. Then 
for every t > ^ log and every positive f £ % there is ||Tt/|| 9 > 

Conversely, if there exists C > such that 

(6.1) l|Tc logi ^/|| g >||/|| p 

for all p and q such that < q < p < r, and for all positive f £ H then {Tt)t>o satisfies 
r-logSob with the constant C . 

Remark 6.2. Theorem 3.3 of Bakry's lecture note [BGM94] established similar equivalence 
between reverse hypercontractivity and r-logSob when r < 1. Indeed the converse part of 
Proposition 16.11 follows from that. But the forward direction, which turns out to be more 
useful in practice, Theorem 3.3 of [BGM94J assumes that r-logSob holds for all r belonging 
to some nonempty open interval instead of a single point. 
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Proof. Let us divide the proof of the first assertion into two basic cases: < q < p < r 
and r' < q < p < (and in fact we will need to prove only first of them since the second 
follows then by Proposition I5.3j> . Once they are proved, the assertion for < q < p < r and 
r' < q < p < will follow by passing to a limit (q — > + and p — > 0~ , respectively), while the 
case q < < p will follow from 



\\T t f\\ q = \\T t _c log ^(Tc log ^f)\\ q > ||T flog ^/|| > 

4 & 1— p 4 ta 1— p 4 to 1— p 

since t — ^ log jz^ > ^ log(l — q) for t>^ log jjj^ (we "glue" the two cases together at zero). 
Let us assume < q < p < r, then. Consider a function t(q) = ^ log defined on (0,p]. 

Then t(p) = and q 2 t'(q) = gffirn > so that °y @3D the map q i-> is non-increasing 

on (0,p] because r-logSob implies q-logSob, with the same constant C, by Theorem 11.81 At 
the right end of the interval the map takes on the value so that ||Tt p ,/||g > II /lip f° r 

t p , q = f log jE%. For t > t p>q we simply express T t f as T t „ tpA (T tp J) and use Lemma O 

To prove the converse assertion, let us fix some p E (0, r) and a positive f EH. For g E (0, p] 
let t(q) = ^ log jE^i so that i(p) = 0. Since the map q i— )■ is non-decreasing on (0,p] 

formula (|4.ip used at g = p yields p-logSob with the constant C. Passing to the limit p — > r~ 
ends the proof. □ 

We can now prove Theorem 11.111 In fact we will prove the following result which includes 
an inverse. 

Corollary 6.3. If a symmetric Markov semigroup {Tt)t>o satisfies r-logSob with constant C 
and r > 1 then for all q < p < 1 and every positive f E H for all t > Q log j^- we have 

Conversely, if for some C > a symmetric Markov semigroup (Tt)t>o satisfies \\Tc , i- g /|L > 

- 4 10 S l-p 

|| /||p for all < q < p < 1 anc? aZZ positive f £ H then it also satisfies 1-logSob with the con- 
stant C . 

Proof. By Proposition 11.81 for all r E (0, 1) also r-logSob holds, with the same constant C. 
The assertion follows immediately from Proposition 16.11 

The converse assertion is easy - Proposition 16.11 implies that (Tt)t>o satisfies r-logSob with 
the same constant C for all r E (0, 1), and it suffices to pass to the limit (r — > 1~). □ 



As in |Bor821 IMOR+ 06] we can now obtain the two function version corollary. 



Corollary 6.4. If a symmetric Markov semigroup {Tt)t>o satisfies 1-logSob with constant C 
then for all < p,q < 1 and every nonnegative f,g E H for all t > — § log[(l — p)(l — q)] we 
haveE[fT t g}>\\f\\ p \\g\\ q . 

Proof. Fix < p,q < 1 and t > ^log[(l — p)(l — q)]. Approximating the nonnegative 
functions / and g by positive functions / + e and g + e and then in the end letting e X 0, 
we can assume, without loss of generality, that the functions f,gEH are positive. Applying 
the reverse Holder's inequality (Lemma 15. 2p . we have E[fTtg] > ||/||p||Tt<7||p'- It remains to 
show that ||2t<7|L/ > \\g\\ q , which immediately follows from Corollary 16.31 once we note that 
(l-p') = (l-p)" 1 . ' □ 

We conclude this section by proving Corollary 11.121 



Proof. Indeed, Lemma 13.51 and Proposition 13.71 fin the product case) imply that (Tt)t>o sat- 
isfies 1-logSob with constant 4, so that it suffices to use Corollary 16.31 □ 
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7. Improved reverse bounds for simple semigroups 

Actually, we can significantly weaken the condition t > log in Corollary 11.121 for simple 
operators and prove Theorem 11.131 

Proof. It suffices to prove the claim in the case q < p < - Proposition 15.31 together with an 
observation that for < q < p < 1 there is p' < q' < and 

2 -p> _ (i_ g )(2-p) 
2-q> (l-p)(2-q) 

will do the rest. Also, we can restrict to the case L = Id — E - the product case will follow 
by Lemma [5-41 

Let q < p < and L = Id — E, then, so that 

T t f = e~ tL f = E/ + e-*(/ - E/) = E/ + 0(/ - E/), 

where (9 := e~* < 9 := §Ef G (0, 1). For s < 0, define V a : (-1, oo) -> [0, oo), 

T , , l + sx-(l + x) s 

t> s (x) = i '-. 

s 

It is easy to check that ^ s is a convex function with ^(O) = ^(0) = and ^"(x) = 
(1 — s)(l + x) s ~ 2 (actually, the same properties hold true in the case s > 0, and in the case 
s = with ^>o(x) = x — log(l + x), but we will not need those). The inequality 

(7.1) * q (9x) < %(x) 

holds true for every x £ (— 1, oo). Indeed, due to the properties of listed above it suffices 
to prove that 9 2 *&q(9x) < ^^(x). This is equivalent to the inequality 

(7 2) (!+£)!< /(2-g) 2 (i-p)\^ 



(l + 9x) ~ \(2-p)\l-q) 

which immediately follows from the elementary inequality (1 + x) e < 1 + Ox, and from the 
fact that the map 

s ^(2- s y 2 (i-s 



2-s 

is positive and non-decreasing on (— oo,0). We are to prove ||Ti/|| g > for every positive 
/ € H. By the homogeneity, we may and will assume that E/ = 1, so that g = f — 1 and 
g = 69~ 1 g are zero-mean and take values in (—1, oo). Then we have 

\\T t f\\l = ||1 + 9g% = ||1 + 6g% = (E(l + ^)T /9 = U - <zE* ? (^)) |p|/l<?l < 
1 _ ]p. q m q {9g) = 1 + HE*, flff < 1 + \p\E* p (g) = 1 + |p E* P (99^g) = 



i + \p\m p (oo- x g + (i - • o) < i + \ P \^{de-H p {g) + (i - ^- 1 )* p (o)) 



< 1 + |p|E* p ( 5 ) = 1 - pE* p ( 5 ) = E(l + gf = EfP = 

which ends the proof - recall that the exponent p is negative. The first inequality above was 
just an application of the elementary (1 + x) a < 1 + ax, with a = \p\/\q\ € (0,1), x > —1. 
The case p = follows by an obvious limit transition. □ 
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Remark 7.1. Note that we could obtain a better reverse hypercontractivity constant than 
those given in Theorem 1 1.1 31 by first maximizing the function fy q (9x)/*$> p (x) over x G ( — l,oo) 
and then by trying to solve for 9 in terms of p and q such that (|7,ip holds. This would lead 
to an equation of the form 

i_0\-(p-«) n-p)0p , . ... . 

> where r = (2-p)/(2-g). 



1-rJ (l-q)r 2 +P 

But unfortunately, in general, 9 can not be recovered explicitly from the above equation. 
However, in the special case when — oo < q < p = 0, 9 can explicitly be solved as 

9 = l + q(2-q)- 1+2 /' 1 [i(l-q)]- 1 ^. 

Corollary 7.2. If a symmetric Markov semigroup (Tt)t>o has a simple generator L = Id — E 
or it is a tensor product of such simple semigroups, then for allO < p, q < 1 and all nonnegative 
f, g G %, we have 

E[fT t g] > \\f\\ P \\g\\ q , 

for all t > log jf- p) ^ q \ . 

J — 6 4(l-p)(l-g) 

Proof. Fix < p, q < 1 and t > log 4^ J^-^J^ ■ Approximating the nonnegative functions 
by postive functions if necessary, we can assume, without loss of generality, that the func- 
tions /, g G T~L are positive. Applying the reverse Holder's inequality (Lemma I5.2H . we have 
E[fT t g] > ||/||p||7t<7||p/. Now consider tift^ > such that t% = log ^f- = log 2(i~-p) an< ^ 

ti = log 2( 2 i_^) • Note that t > t\ + t^. Thus, using the semigroup property, we can write 
Tt = Tjj o Tt 2 o T t _^ 1+t2 y Therefore we conclude that 

\\T t g\\ P > > \\T t2 oT t _ {tl+t2) g\\ > ||T t _( tl+t2 )fli|| g > \\g\\ q , 

where we used Theorem 11.131 in the first and the second inequality and Lemma 15.11 in the 
third inequality. □ 

We now obtain the following corollary regarding p- correlation. 

Definition 7.3. Consider a product space (O, p) = (n?=i < S> ) f=i/ / i) where (fij, pi) are finite 
probability spaces. We say that (x, y) £ f2 2 are p-correlated if x is distributed according to 
p and the conditional distribution of y given x is given as follows: for each i independently, 
with probability p, yi = Xi and with probability 1 — p, yi is sampled independently from p{. 

Lemma 7.4. Let (£l,p) be the product probability space in Definition \7.SX Let A,B C Q be 
two sets such that p{A},p{B} > e > 0. Let x be distributed according to the product measure 
p and y be a p-correlated copy of x for some < p < 1. Then 

■2-s/p 

F{x G A,y G B} > e 1 -^ . 
Proof. Let / and g be the characteristic functions of the sets A and B respectively. Note that 

¥{x eA,yeB} = E[f(x)g(y)} =E[f(x)E[g(y)\x]) = E[fT t g), 
where t = log(l/p) and T t = ®? =1 Tf where Tf = e -*( H - E ). So, by Corollary EH we have 
(7.3) E[fT t g] > \\f\\ p \\g\\ q , 

for all < p,q < 1 such that p = A ^-p)(2-q) • We now take P = 9 = in D to 

conclude the proof. □ 
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Remark 7.5. We can also use Corollary 16.41 which deals with general symmetric Markov 

2 

semigroups to get a lower bound e 1- ^. But this bound is worse than what we have achieved 
by using Corollary 17.21 that improves on the bounds provided by Corollary 16.41 in the case of 
simple semigroups. 

Remark 7.6. The lower bound in Lemma 17.41 does not depend on the marginal measures 
(J>i, 1 < i < n. Obviously, it is cannot be improved for p = 0. However, it is also quite close to 
optimal for p close to 1. To see it, consider 0^ = { — 1, 1} and pi = \o~-i + for all Vs. Let 
c = c(e) be such that ^vr)" 1 / 2 I 2 du = e, and let A = {z : n" 1 / 2 £" =1 z t < -c} and 

B = {z : n -1 / 2 Y27=l Zi — c i- Finally, let (G\, G2) be a Gaussian random vector with AA(0, 1) 
marginals, such that EfGiGy = p. By the Central Limit Theorem //{^4} and p{B} tend to e 
as n — > 00, while ¥{x 6 A,y £ B} tends to P{G*i < — c, G2 > c}. Now it suffices to note that 

log, P{Gi < -c, G 2 >c}> log, P{G 2 - Gt > 2c} ^ 

and 

2- y/p 1 < 2 < 2-,/p- 

1-y/P 2- 1- P - 1-y/p' 

which holds for all p G [0, 1). We skip some tedious but straightforward calculations. 



8. Reverse hypercontractivity for some non-simple operators 

For some of the applications afterwards we will be interested in operators that are not nec- 
essarily simple but are obtained by composing a simple operator with a non-simple operator. 
In this section we extend some of the reverse hypercontractive results to this setup. 

Proposition 8.1. Assume that p) is a finite probability space and K is Markov kernel on 
O. Let v = pK and 

(8.1) a:= min > 0. 

x,y.u{y}>0 V{y\ 

Let a* = — log(l — a). Let the operator K® n be the n-fold tensor product of the kernel K on 
the product space (Q n , p® n ). Then for all f : Sl n — > R + , for all q < p < and a* > log 

and also for all < q < p < 1 and a* > log we have \\K® n fWiq^n} > \\f\\LP(v® n )- 

Proof. Some readers may find it convenient and natural to interpret the abstract operations 
of this proof in terms of matrix multiplication - then measures and functions should be repre- 
sented, respectively, by horizontal and vertical vectors with coordinates indexed by elements 
of O, and Markov operators become stochastic matrices (for example should be understood 
as a matrix with all rows equal to the vector representing measure p) . First assume that n = 1 . 
Let T t = e~ l L + (1 - e~*)E M , t > be the simple Markov semigroup on (Q,p). We can extend 
the definition of Tt for t < as well, though it may no longer be a Markov operator. However, 
it is easy to check that T^tK = e t K — (e* — 1)E^ is a Markov operator for small enough t > 0. 
First, for all i, (T-tK)l = T^t(Kl) = T_tl = 1, so it remains to check the positivity of T_tK- 
Second, to ensure positivity, we need to show that e t K(x, y) — (e* — l)v(y) > for all x, y E CI 
which holds if < t < — log(l — a) = a*. 

Thus S := T_ Q * o K is Markovian and the kernel K can be written as composition of two 
Markov kernels in the following way: 

(8.2) K = T a *S. 
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For every probability measure p on we have pK^ = p, in particular for p = p, and therefore 
pT% = p for every real t. Hence (|8.2p implies 

pS = (pT a *)S = p(T a *S) = pK = v. 

Using the decomposition (18 . 2 j> . by Theorem II, 131 we obtain 

11*7112*00 = \\T a *(Sf)\\ LqM > \\Sf\\ LPM > \\f\\ LP(u) , 

for p and q as in the hypothesis. Since E^[/ p ] = (pS)f p = E,^[S(f p )], the last inequality 
follows from the pointwise (coordinatewise) inequalities (Sf) p > S(f p ) for p G (0, 1) and 
(Sf) p < S(f p ) for p < 0. They, in turn, follow from the Markovianity of S and concavity 
(resp. convexity) of the function (0,oo) B t i— )■ t p for p G (0, 1) (resp. p < 0). The proof for 
general n now follows from standard tensorization argument. □ 

Corollary 8.2. Consider the set-up of Proposition \8.1[ Then for all < p, q < 1 and all 

nonnegative /, g, we have 

nfK® n g] > ll/H 1 1<? 1 1 LP (f®™)) 

for all a* > log ggfSj) ■ 

Proof. Same as Corollary 17.21 □ 

Lemma 8.3. Lei (xj,yj)i<j< n 6e i.i.d. £l 2 -valued random variables. Let p and v be the 
marginal distributions of Xi and yi respectively and let K denote the conditional probability 
kernel K(a,b) = = b\xi = a}. Assume that a > where a is given in (|8.ip , so that 

P{xj = a,yi =6} > ap{a}u{b} for all a, b G O, and all i < n. Then for any two sets 
A, B C VL n such that p® n {A}, v® n {B} >e>0i»e have 

(8.3) P{i6AyeB}>e^^, 

where x = (x\,X2, • • • , x n ) and y = (j/i , 3/2, • • • , 2/n) fflre Vt n -valued random variables. 

Proof. Let / and 5 be the characteristic function of the sets A and B respectively. Note that 

¥{x G A,y G 5} = E[/(s) 5 (y)] = [/(x)E[5(y)|x]] = E„ 9 „ . 
Now by Corollary IQl 

E M ®« [/if®"^ > ll/lliPQ^njIlgllw^®")) 

for all < p, q < 1 such that 1 — a = ^2-p)(2-q) ' ^ e take P = Q = t° conc hide the 

proof. □ 

Remark 8.4. The following example shows that the condition a > cannot be dropped in 
general. Take O = {0, 1} and p to be the unbiased Bernoulli measure on fi. Let the kernel K 
be as follows: 

K=(° 1 
\l/2 1/2 

so that v = (l/4)<5 + (3/4)<Si. Now take A = {x\ = 0} and 5 = { yi = 0}. Then p® n {A} 
1/2 and ^® n {S} = 1/4 but F{x G A,y G B} = 0. 
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9. Mixing of Markov chains for big sets 

In this section we prove Theorem 1 1 . 1 5 1 which establishes mixing for Markov chains satisfying 
1-logSob. We then give a number of examples where the theorem can be applied to yield new 
results on mixing of Markov chains starting from big sets. We begin with a proof of the 
theorem: 

Proof of Theorem \1.15[ Take / and g to be the characteristic functions of A and B, respec- 
tively. Then by Corollary 16.41 for any choice of < p, q < 1 with (1 — p)(l — q) = e~ u l c , we 
get 

(9.1) ¥{X G A,X t e B} = E[fT t g] > \\f\\ p \\g\\ q = exp(-a 2 /2p) exp(-& 2 /2g). 

By setting p = 1+ e~2t/c(6/ a ) anc ^ 1 = i+t~2*/C(a/6) (^ n ^ s choice follows from a simple optimiza- 
tion) we conclude the proof. □ 

The above result says that as long as tt{A},it{B} > c and t > tC (where C is 1-logSob 
constant), we have a uniform lower bound on the hitting probabilities P^jXo € A,Xt G B} 
which only depends on c and r and is independent of the chain. As was discussed at the 
introduction the bound in the theorem is often stronger than expander mixing lemma bounds 
which require the sets to be large relative to e~ l l G where C is the 0-logSob constant. Further 
the bound is relevant at times t smaller than the mixing time since once mixing is reached 
within e we have ¥{X £ A,X t £ B} > ir{A}(ir{B} - e). 

We follow below with a list of examples where the new bound is of natural interest. We 
highlight examples where the mixing time is much larger than the 1-logSob constant since in 
these examples our bound is better than the mixing bound even if one of the sets A or B has 
large measure. 

9.1. Short walks in general hypercube. The first example is simply a random walk on a 
product space. Let (fl, fj,) be a finite probability space and n > 1. Consider the hypercube 
(f2 n ,/i® n ). The continuous-time random walk on this space corresponds to selecting one 
coordinate uniformly at random with each ring of a Poisson clock (with intensity 1) and 
updating that coordinate according to the distribution fj,. This is a reversible Markov chain 
with invariant distribution /i® n . When Q = {0,1} and /^{0} = fi{l} = 1/2, we have the 
standard continuous-time random walk on the hypercube. Let L = Id — E be the generator 
of the simple Markov semigroup on (O, //) and (Tt)t>o be the corresponding semigroup, then 
the generator of the random walk on the general hypercube is given by 

1 ™ 

L prod _ _ H q j d ^ . . . £ L g, . . . g, M 

n ^ v -v- / 

and the corresponding Markov semigroup can be expressed as 

lf od = T t/n ® T t/n (8) • • • ® T t/n , for t > 0. 
From Lemma |3.5[ Proposition 13.71 and Theorem II. 151 it follows that: 

Corollary 9.1. Let X% be the continuous-time random walk on the general hypercube (il n , jit® n ) 
with Xq distributed according to the product measure [i® n . Let a,b > 0. Then for any 
A,B QQ n with n® n {A] = e" a2 / 2 and fj,® n {B} = e~ h2 / 2 , and for t > rn, we have 

f la 2 + 2e- T / 2 ab + b 2 \ 



"{Xq eA,X t eB}> exp 



2 1 



In fact, a much better bound can be obtained by repeating the proof of Theorem 1 1 . 1 5 1 wit h 
the two function bound in Corollary 16.41 which applies to simple operators and their tensors: 
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Proposition 9.2. Let Xt be the continuous-time random walk on the general hypercube 
(£l n ,p® n ) with Xq distributed according to the product measure p® n . Let a,b > 0. Then 
for any A,BQ£l n with p® n {A} = e~ a ' ''I 2 and p® n {B} = e~ b2 / 2 , and for t > rn, we have 

mrv < v ( (2-e- T )(a 2 + b 2 ) + 2e~ T / 2 ab\ J+b*-*b ( (a + bf 
¥{X eA,X t eB}> exp -i A ^ > e J exp ' 



4(1 -e-r) I ~ * V 4(1 -e—) 

Proof. Note that the pair {Xq, Xt) is p-correlated in the sense of Definition [731 with p = e~*/ n . 

Let P = bS+(2- a p)a aIld 1 = a$H2-p)b > S ° that P'9 G ( ' 1 ) aild ^-riS-q) = P' ^ 

Corollary 17 21 applied to / = 1^ and g = 1b we have 

F{X e A,X t € B} = E[fTr d 9] > \\f\\ P \\g\\ q = exp(-^ - 

and the first inequality of the assertion follows easily. The second inequality in the assertion 
of the proposition is elementary □ 

Remark 9.3. The bound of Proposition 19.21 is quite tight, especially for small values of r, 
/j,® n {A}, and /j,® n {B}. Indeed, let us fix t = rn, choose a = a(a),{3 = j3(b) such that 

(27T)-V2 f a °° e-« 2 /2 du = g -a 2 /2 and ( 27r )-l/2 e ~^/2 du = g -6 2 /2 ) and then define two 

subsets of the discrete cube, A = {z : n~ l l 2 Ym=x Zi — ~ a } an( ^ B = {z : n -1 / 2 Y17=i Zi — ^i- 
Now it suffices to use the CLT as in Remark 17.61 (recall that in our setting p = e~ T ) and 
observe that 

(1 - e' T ) lnP{Gi < -a, G 2 > (3} < (1 - e~ T ) lnP{G 2 - G x > a + /3} -(a + /3) 2 /4 

while lim^oo a(a)/a = 1 and lim^oo /3(b)/b = 1. We skip tedious but quite standard 
calculations. 

We note that the mixing time of the above walk is of order nlogn. Therefore using the 
mixing time it is impossible to obtain effective bounds even when one of the sets A or B has a 
large measure and t is of order n (as noted before if both sets are of large measure depending 
on r, the expander mixing lemma applies). 

9.2. An example from queueing theory. In this subsection, we will give an example where 
we will show reverse hypercontractivity for Markov semigroup arising from a standard q/q/oo 
process (defined below). We will not use any knowledge about the p-logSob constants of the 
semigroup but establish reverse hypercontractivity by taking Poissonian limit of the reverse 
hypercontractive estimate for n-dimensional hypercube with product Bernoulli measure (j> = 
A/n). The example is of interest for a number of reasons: 

• It deals with a Markov chain defined on an infinite state space. 

• It is an example where the 2-logSob and the mixing time are both infinite, yet it is 
possible to obtain reverse hypercontractive and mixing estimates. 

• It is a natural example for queueing theory. 

Let p p be the Bernoulli measure (1 - p)So + p5i on {0, 1}. Let {x[ n) )t>o be the Markov 
process on state space {0, 1} corresponding to the simple semigroup generated by L — E w.r.t. 
the measure p p with p = A/n,A>0 fixed. Let X n,x , X n ' 2 , . . . , X n,n be i.i.d. copies of X^ n \ 
The process 

y(n) . = x n,l + x n,2 + _ _ _ + 

is again Markov (with state space N) whose 

generator ZA n ) satisfies 

i»/(x! +x 2 + ... + x n ) = {I- E^ /n )® n f( Xl ,x 2 , x n ), Xi E {0, 1}, 
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where / : {0, l} n — > R is given by the relation f(x±,X2, ■ ■ ■ ,x n ) = f{x\ + X2 + • • • + x n ). 
Clearly, v n = /^w n , the n-fold convolution of fi\/ n7 is the reversible measure of Y^ n \ A simple 
calculation yields 

L<»)/(k) = (1 - £)*(/(*) - f(k - 1)) + A( n _ k )(f(k) - f(k + 1)). 

So, as n — > oo, the sequence of generators converges to the generator L which is given by 

Lf(k) = -(k + X)f(k) - kf(k - 1) - Xf(k + 1), 

for all / : N — > R. One can easily recognize the above generator as the generator for the 
well-known q/q/oo process which we denote by (Yi)t>o- Thus (lt)t>o is a continuous time 
Markov process taking values in non-negative integers where Yf represents the number of 
customers in the queue at time t in the following set-up. There are infinite number of servers, 
the customers arrive according to a Poisson process with rate A and the service time of each 
customer follows an independent exponential with mean 1. This process is reversible w.r.t. 
lim n v n = Poisson(A). Since convergence of the generator implies the convergence of the 
process, we have, for each t > 0, 

(9.2) Yj n) A Y u when Y^ = Y . 

Here -4- means the convergence in distribution. 

Let (T^)t>o (resp. (Tt)t>o) be the semigroup corresponding to the Markov process (1^ )t>o 
(resp. (Y t ) t > ). 

By Theorem 11.131 for any bounded / : N — > R+, < q < p < 1 and n > 1 , 

\\T t (n) f\\ LH u n) > for t > log ^ZfpZ^y 



Letting n — > oo, by ([9.2p . we conclude that 

\\T t f\\ L « {Pom > ||/|| iP (Poi(A) ); for t > log ^I^lg - 

Similarly by approximating the process (Yt)t>o by the process y( n ) and applying Theo- 
rem [TTT5] we obtain that for any two sets A, B of integers of (Poisson A) measure e~°" 2 1 2 and 
g-k 2 / 2 respectively it holds that 



P{y G A, Y t € B} > exp 



1 a 2 + 2e- t ' 2 ab + b 2 



2 1 - e-* I 
If instead of Theorem 11.151 we use Proposition 19. 2( we obtain a stronger estimate 

(2-e- t )(a 2 + b 2 ) + 2e- t / 2 ab\ 



P{Y eA,Y t £B}> exp 



4(1 



Note again that this result holds in an example where the mixing time and 2-logSob constant 
are infinite (see [BL98J where it is shown that the 1-logSob is finite). 

9.3. Glauber dynamics on Ising model on finite boxes of Z d . The Ising model on a 
finite graph (V, E) has the state space f2 = { — 1, +1}^. The probability of a spin configuration 
cr E 0, is given by the Gibbs distribution, 



]— exp j p v(.u)a{v) +h^2 a ( u ) ) » 

' ' \ uv&E u&V / 
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where Z((3, h) is the normalizing constant. The parameters f3 > and h are called the inverse 
temperature and the external field respectively. These definitions extend to infinite locally 
finite graphs like Z rf . 

The Glauber dynamics for the Ising model is a family of continuous time Markov chains 
on the state space fi, reversible with respect to Gibbs distribution, given by the generator 

(Lf)(a) = ^2c(u,a)(f(a u )-f(a)), 

where a u is the configuration a with the spin at u flipped. We consider the two examples of 
transition rates c(u,a): 

(1) Metropolis: c(u, a) = exp (2ha(u) + 2(3a(u) Y. U vaE a ( u )) A L 

(2) Heat-bath: c{u, a) = [l + exp ( - 2ha(u) - 2f5a{u) Y, U v&E a ( u ))] 

Let A := [—n,n] d C % d be a finite box in the d-dimensional lattice. Let <9+A C A c be 
the vertex boundary of A in Let \x be the Gibbs distribution on Z rf . Given a boundary 
condition r 6 { — 1, we define a Gibbs distribution on A as a conditional measure: 

MA = K'Wd+A = t). 

Suppose that the inverse-temperature f3 and external field h are such that the Ising model 
on 7j d has strong spatial mixing. Then there exists a constant K, independent of n, such 
that given any boundary condition, the 2-logSob constant for the Glauber dynamics for the 
Ising model on the finite box A is bounded above by K, independent of n (see |MQ94a| 
M094bJ which succeed [SZ92b, SZ92a, Zeg92 where uniform bound for 2-logSob constant 



was established under stronger Dobrushin-Shlosman mixing conditions). It is also known 
that in the regime of strong spatial mixing, the mixing time of the Glauber dynamics is 
W = 0(k>gn). 

The example above can be easily extended to other spin systems and other graphs as long 
as 1-logSob inequality is established. 



9.4. Random transposition walk on symmetric group. The random transposition walk 
on the group S n of permutations of n elements is the walk generated by the set of all transpo- 
sitions C n = : 1 < i < j < n}. The Markov transition from any a £ S n is described by 
picking a transposition r uniformly at random from C n and compose it with a to get a new 
permutation tog G S n . It was shown in |GQ03 , [BT03| [Goe04| that the 1-logSob constant C 



of this chain is of order n. More precisely, 

n — 1 



< C < 2(n- 1). 

On the other hand, it is well known (see |DS81J) that the mixing time t m j x = G(nlogn). It's 
worth mentioning that the 2-logSob constant of the random transposition walk was determined 
in [LY98] to satisfy C = 9(nlogn). 



9.5. Top-to-random transposition walk on symmetric group. This is a random walk 
on S n generated by the set of transpositions T> n = ■ 2 < j < n}. Again the 1-logSob 

constant C of this chain satisfies |Goe04] 

!^<C<2(n-l), 
whereas the mixing time is £ m i x = ©(nlogn) (see [DFP92J). 
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9.6. Random walk on spanning trees. This is a natural random walk on the space of all 
spanning trees of a graph G = (V, E). Suppose T be our current spanning tree. We choose an 
edge e G E and another edge / £ T uniformly at random. If T' = T U {e} \ {/} is a spanning 
tree of G, we update T to T', otherwise we remain at T. It was shown in [JS02J that the 
2-logSob constant of this walk satisfies 

C < \V\\E\, 

and consequently, t m [ x = 0(\V\\E\ log |V|). In general, the upper bound for the mixing time 
is tight. For example, consider a line of length n and replace each edge by a double edge. 
Thus the new graph has \v\ = n + 1 and \E\ = In. The mixing time for the random walk on 
the spanning trees of this graph is same as the coupon collector problem with a delay of B (n) 
between successive moves. 

9.7. Bernoulli-Laplace model. This is natural random walk on the subsets of size r of the 
ground set {1,2,... , n}, 1 < r < n. So, the state space has size f"j. If the current state 
of Markov chain is an r-set A, we pick an element i uniformly at random from A and pick 
an element j uniformly at random from {1, 2, . . . , n} \ A and switch the elements to obtain a 
new r-set A' = A U {j} \ {i}. This is also known as simple exclusion process on the complete 
graph on n vertices. The 1-logSob constant of this chain satisfies |GQ03 IBT031 IGoe04] 



r(n — r) „ 2r(n — r) 

-1 L <C< — -. 

2n n 

The mixing time for Bernoulli-Laplace model is i m ; x = 0( log log ("))• 

10. A QUANTITATIVE ARROW THEOREM FOR GENERAL RANKING DISTRIBUTIONS 

Our goal in this section is to prove Theorem 11.161 We begin by briefly introducing some 
additional notation. Let A = {a, b, . . . , } be a set of k > 3 alternatives. A transitive preference 
over A is a ranking of the alternatives from top to bottom where ties are not allowed. Such 
a ranking naturally corresponds to a permutation a of the elements 1, . . . , k. The group of 
all rankings will be denoted by Sk- A constitution is a function F that associates to every 
n-tuple a = (cr(l), . . . , o~(n)) of transitive preferences, and every pair of alternatives a,b € A, 
a (strict) preference between a and b. Some key properties of constitutions include: 

• Transitivity. The constitution F is transitive if F(o~) is transitive for all a. In other 
words, for all a and for all three alternatives a, b and c, if F(o~) prefers a to b, and 
prefers b to c, it also prefers a to c. Thus F is transitive if and only if its image is a 
subset of the permutations on k elements. 

• Independence of Irrelevant Alternatives (HA). The constitution F satisfies the IIA 
property if for every pair of alternatives a and b, the social ranking of a vs b (higher 
or lower) depends only on their relative rankings by all voters. 

• Unanimity. The constitution F satisfies Unanimity if the social outcome ranks a 
above b whenever all individuals rank a above b. 

• The constitution F is a dictator on voter j, if F(a) = o~(j), for all a, or F(a) = a(j)^ 1 , 
for all a, where cr(j) _1 is the inverse of the permutation cr(j). 

We will assume each voter chooses one ranking from according to some fixed distribution 
g, independently of others. We will write P for the product measure g® n on and E for 
the corresponding expected value. Given two constitutions F, G on n voters, we denote the 
statistical distance between F and G by D(F, G), so that: 

D(F,G) :=F{F(a)^G(a)}. 



Our goal is to prove the following theorem which is a more explicit version of Theorem 11.161 
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Theorem 10.1 (Quantitative Arrow's theorem for general distribution). Let g be general 
distribution on Sk which assigns positive probability at least a to each element of Sk- For any 
number of alternatives k > 3 and e > sufficiently small, there exists 5 = 5(e) > 0, such that 
for every n, if F is a constitution on n voters and k alternatives satisfying: 

• II A and 

• P{i ? (cr) is transitive} > 1 — 5 

Then there exists a function G which is II A and transitive and such that D(F, G) < e. More- 
over, one can take 



10.1) 5 = exp 



Cq- 7 2 a " 2 (log(l/e)) 2 



for some suitable absolute constant C > 0. 

Our proof is an adaptation of the proof in |Mosl2j. We begin with some notation and 
definitions from [Mosl2]. 

Given a = (c(l), . . . , <?(ri)) 6 S2 and for each pair of alternatives a, b G A, we define binary 
vectors x a>b = x a>b (a) in the following manner: 

xCL>b (j) = 1] if voter j ranks a above b; 

and 

xCL>b (j) = — 1) if voter j ranks b above a. 
Thus, if F satisfies the IIA property then there exist functions f a>b for every pair of candidates 
a and b such that 

F(a) = ((f a>b (x a>b ):{a,b},a^beA) 

where f a>b : {-1, l} n -»■ {-1, 1} is such that f a>b = +1 if F ranks a over b and f a>b = -1 
otherwise. 

We define PX(/i, /2, /a) (PX stands for paradox) for three function /i, /2, : { — 1, l} n — >• 
[—1,1] by letting 

px(/!,/ 2 ,/ 3 ) = ){i+nh(x a>b )f2{x b>c )]+nf2{x b>c )h{x c>a )\ 



4 



+E[f 3 (x c > a )f 1 (x a>b )}). 



Note that for k = 3, the probability of non-transitive outcome is given by 
P(F) := P{(r >fe , f b>c , f c>a ) e {(1, 1, 1), (-1, -1, -1)}} 
= PX(f a > b J b > c J c>a ). 

In the rest of the subsection, we denote by a, the probability mass of smallest atom of 
the distributions of the random vectors (x a>b (l),x b>c (l),x c>a (l)) on { — 1, l} 3 for triplets of 
distinct alternatives a,b,c £ A. 

We now quickly discuss the notions of influences that are needed in the proof. For a 
function / : { — 1, l} n — > M where { — 1, l} n equipped with n-fold product of some biased 
measure fj, p = (1 — p)5-i + pS±, define the influence of variable i on / by 

InfiCf) := li® n {f(x l: . . . ...,x n )^ f{x x , . . . . . .,x n )}. 

i i 

Let {V'o = IjV'i} f° rm a basis of L 2 ({ — 1, 1}, /i p ). Then we can express / in its Fourier 
basis as follows: 

/(*)= ^ Rs)l[Mxi). 

SCW i£S 
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We define variance-influence of variable i on / as 

h(f) ■= £ f{sf. 

S:ieS 

When / is ±l-valued, it can be easily checked that the above two notions of influences are 
equivalent up to a multiplicative factor (independent of n) as follows: 

/,(/) < Inf i(/ ) < ^T^I*(/)- 
We also need the notion of low-degree variance-influences. For d > 0, this is defined as follows: 

s-.ies,\s\<d 

Under our assumption on the minimum atom of g, it's not difficult to show that for any 
three distinct alternatives a,b,c £ A and any voter i, we have 

\Corr(x a>b (i),x b>c (i))\ < 1 - 4a. 

The following lemma is a consequence of the reverse hypercontractivity in the biased space. 
It is the key ingredient needed to extend the argument of |Mosl2) to the nonuniform case. 

Lemma 10.2. Consider a social choice function on 3 candidates a,b and c and n voters 
denoted 1, . . . ,n. Assume that the social choice function satisfies that IIA condition and that 
voters vote independently according to g whose atoms are bounded below by a constant a > 0. 
Assume further that Infi(/ a>b ) > e and Inf2(/ 6>c ) > e. Let 

A = {a : 1 is pivotal for f a>b }, B = {a : 2 is pivotal for f b>c }. 

Then 

¥{AnB} > ewi-". 

Here voter j is called 'pivotal' for f a>b at a if f a>b is a non-constant function of the j th 
variable when we freeze the other (n — 1) variables at x a<b (a). 

Proof. Clearly, (x a>b (i), x b>c (i))i<i< n are i.i.d. with a joint distribution on { — 1, l} 2 deter- 
mined by g. Let ji and v be the marginal distributions of x a>b (i) and x b>c (i) respectively. 
Note that the event A is determined by x a>b and the event B is determined by x b>c and the 
their intersection probability is determined by the joint probability distribution of the random 
vectors x a<b and x b<c . Let Aq and Bq be the subsets of { — 1, l} n defined by: 

A = {x a<b G {-1, l} n : f a<b {x a<b ) £ f a<b (x a<b ei)}, 

and 

B = {x b<c E {-1, l} n : f b<c {x b<c ) £ f b<c (x b<c e 2 )}. 

Observe that ^ n {A } = Infi(/ a>b ) > e and v® n {B } = Inf 2 (/ 6>c ) > e, and our goal is 
to obtain a bound on P{Ao n Bq}. But now we are exactly in the set-up of Lemma 18.31 If 
K denotes the conditional distribution of x b<c (i) given x a<b (i), then we have the following 
lower bound on 

K(u,v) . P{x a<b (i) = u,x b<c (i) = v} 
mm — — = mm — — — > a. 

u,ve{-i,i} v\v\ u,t>e{-i,i} h L \ u ) l '\ v ) 

The proof now follows from Lemma [8.31 □ 

The reminder of the proof is a straightforward (though somewhat tedious) generalization 
of the proof given in [Mosl2j that does not use reverse hypercontractivity. A sketch of the 
modifications needed is given in Appendix lAl 
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11. NON-INTERACTIVE CORRELATION DISTILLATION FOR DICE 



The proof of Theorem 11.171 is a generalization of the proof given in |MOR+06| . The proof 
of the upper bound uses reverse hypercontractivity while the lower bound is based on the 
analysis of a simple protocol that is based on the plurality function and the analysis relies the 
on normal approximation. Here we give the proof of the upper bound while the proof of the 
lower bound is given in Appendix iBl 

Proof of upper bound of Theorem 11.171 Note that the probability of all players output j £ O 
is 



(11.1) 



E 



Y[F{Fi(y) = j\x} 



i=l 



where y is a p-correlated copy of x. Let fij(x) := \{p.^ = jy Thus if t = log(l//?) and Tf is 
the simple semigroup on f2 with the uniform measure, then we have 



fiTTTl) = E 



n E ^-(y)|; 



.4 = 1 



E 



.4 = 1 



i=i 



where the last step follows from the Holder's inequality. Since E/j,- = m _1 for all i,j, we 
conclude, by Lemma 111.11 below, that the probability of total agreement among k players is 
bounded above by X^'6nlli=i II^/mIU — Cmk~^ x for some 71 > depending on p. □ 

Lemma 11.1. Let Q = {1, 2, . . . , m} and p be the uniform measure on fi. Fix any p 6 (0, 1]. 

Then there exist constants C = C(p) > 0, f3 = f3(p) > such that for any f : Q n —> [0, 1] and 
for any k > 1 such that E/ < 1/2, 

im/ut < ck-p. 

Proof of Lemma[TTj\ Suppose \\T t f\\ k k > 25. Define S = {x £ £l n :[T t f(x)] k > 5}. Since T t f 
is bounded between and 1, it follows that K[lg] > 5. If we write / for 1 — /, then the set S 
has the following equivalent description 

5 = {xG{0,l}":r t /»<l-5 1 / fc }. 

Thus clearly we have 

(11.2) nisTj] < (1 - 6V k )F{S}. 

On the other hand, Corollary 17.21 gives us that 

4(l-p)(l-g) 



nisTti] > 111 



Slip 



If we take p = q = ^-^p^ m the above inequality, we have 



for any < p, q < 1 satisfying p < 



(2-p)(2-q) 



E[lsT t /] > F{S} 2 ^-^ (E/) 2 ' 1 -^) . 



(11.3) 

where we have used the fact that E[/ 9 ] > E/ for any g £ (0, 1). Now, comparing (111, 2ft and 
(jll.3p . we have 

F{S} 2 ^^ (E/) 2 ' 1 -^) < 1 - 5 1/k . 
Since, ¥{S} > 5 and Ef > 1/2, we have 



Vp 2 ~Vp . 

$W=7p)2 2 ^~Vp) < 1 - 8 l / k 



which implies that 5 < k P for any (3 < 2<yl and k sufficiently large. 



□ 
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Remark 11.2. It is an interesting problem to find the exact exponent 7 in Theorem 1 1 . 1 71 for 
which lim rt _>.oo M p (k, n) = k~ 1+ °^ as k — > 00. A priori such an exponent might depend on 
m. 

12. Observations and open problems 

Our main result on the monotonicity of r-logSob inequalities implies that the Poincare 
(0-logSob) inequality is the weakest among them. 

However several open problems regarding monotonicity: 
(I) Are there intervals / such that r-logSob inequalities are equivalent for all reversible 
Markov semigroups and all r £ I. In other words, for which intervals /, there exist 
constants c(I) such that for all r, r 1 G /, r-logSob with constant C implies r'-logSob 
with constant c(/)C? Note that Proposition 13.31 implies a positive answer to this 
question with the interval [1 + e, 2] for any e > 0. Note that this interval can not 
be extended to [1,2]. This follows for example from the fact that for the random 
transposition card shuffling on the symmetric group S n , 2-logSob constant is 0(n log n) 
|LY98j whereas 1-logSob constant is known to be O(n) |GQ03j [Goe04]. 
(II) Can one establish similar monotonicity property for hypercontractive inequalities? 

12.1. Reverse hypercontractivity implies spectral gap. Here we show that the Poincare 
inequality may be deduced from reverse hypercontractivity for fixed q < p < 1. This provides 
a partial answer to question (II) above. 

Lemma 12.1. Let q < p < 1 and t > 0. Assume that a symmetric Markov semigroup satisfies 
the hypercontractivity estimate ||Tt/|| g > for every f G %(o,i) • Then it also satisfies the 
Poincare inequality 

2t 

Var( 5 ) < - — r — T-£(g,g) 

log(l - q) - log(l -p) 

for every g €ri. 

Proof. Let A be the smallest nonzero eigenvalue of L. It implies that £{g,g) > AVar(<7) for all 
g belonging to l -1- , where l 1 - denotes the subspace of % consisting of all functions orthogonal 
(in the standard L 2 (0,£t) setting) to the constant function 1, i.e., zero-mean functions. For a 
zero-mean g G ri choose e > small enough to make / = 1 + eg > 0. The inequality 

(||TJ|| g -l)/ £ 2 >(||/|| p -l)/6 2 

upon passing to the limit e — > + yields E[ge~ 2tL g] = E[(T t g) 2 ] < j^E[g 2 ]. Since this bound 
holds for all 5 £ l 1 we infer that e~ 2tx < (1 —p)/(l — q) which ends the proof. □ 

12.2. Spectral gap does not imply 1-logSob. Here we show that the 0-logSob inequality 
does not imply the 1-logSob inequality. In particular it gives a partial answer to question (I) 
above by showing that the r-logSob inequalities in the interval [0, 1] are not all equivalent. 
Recall that a family graphs Q = {G±,G2, . . .} is called a d-regular (spectral) expander if 

(1) For each n, G n = (V n , E n ) is a d- regular graph on n vertices. 

(2) The random walk on G n satisfies a Poincare inequality with constant Co that does 
not depend on n. 

Assume, if possible, that there exists a constant C\ such that for each n, 1-logSob constants 
for the random walks on G n are bounded above by C\. Let {X^)t>o be the continuous-time 
random walk on G n . Since the underlying graph is d- regular, the stationary distribution tt is 
the uniform measure on G n . So, if we take A = {u} and B = {v} for u, v G V n in (|1.5p . then 
we have 

F{X'S = u, X[ l =v}> n~ a Vn, v G V n , 
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where a > is a constant that depends on C\. This implies that 
(12.1) F U {X? = v}> n~ a+1 Vn, v £ V n . 

By considering volume growth around a fixed point for a bounded degree graph, it is easy 
to convince ourselves that the diameter of the graph G n has to be at least cd log n for some 
constant c > 0. We choose u,v £ V n so that their graph distance is at least cdlogn. So, 
starting from u, a discrete time random walk on G n needs at least cd log n many jumps before 
it can reach the vertex v. Note that the number of jumps made by the continuous-time walk 
X" during the time interval [0, 1] is distributed according to Poisson random variable with 
mean 1. Hence, from the tail bound for the Poisson distribution, 

F{Xq = u, Xi = v} < P{Poisson(l) > cdlogn} < exp(— c'dlognloglogn), 

for some constant d > 0. Since the right hand side of the above inequality decays faster than 
any polynomial, it contradicts (112. ip . This proves that 1-logSob constant for the random walk 
on G n tends to infinity as n — > oo. 

Remark 12.2. Explicit lower bounds on 1-logSob constants for connected d-regular graphs on 
n vertices can be found in |Goe04, IBT06] , But our proof is different in the sense that it relies 
on the new mixing bounds implied by reverse hypercontractivity. 

12.3. Generalizations to infinite spaces. It is straightforward to generalize most of the 
result of Sections 1-9 of the paper to infinite probability spaces. The only point which requires 
some care is to work with the appropriate classes of functions. Since the applications in the 
current paper deal mainly with finite spaces we omit this straightforward extension. 
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Appendix A. Proof of Theorem 110.11 

We continue in the proof of the general quantitative Arrow theorem following [Mosl2j. 
The next step is to replace Theorem 7.1 and Theorem 11.11 in [Mo sl2j by the following 
two lemmas respectively. 

Lemma A.l. For every e > there exist 5(e) > and r(<5) > such that the following 
hold. Let /i,/2,/3 : { — 1,1}" —> { — 1,1} and let F be the social choice function defined by 
/<*>& = f u f>c = f 2 and joa = j 3 Assume that f or alll <i <3 and j > I it holds that 

(A.l) /f og(1/r)+1 (/ l )<ar/2. 
Then either 

(A.2) PX(/i,/ 2) / 3 ) >aS, 

or there exists a social choice function G which is either a dictator or always ranks one 
candidate at top/bottom such that D(F,G) < 9e. Moreover, one can take 

5 ^(e/^+i/^ 2 ), r = (5 c iog( r )ios r ) . 

Lemma A.2. For every e > 0, there exist 5(e) > and t(5) > such that the following hold. 
Let fi, fii fz '■ {—1, l} n — > [—1, 1]- Assume that for all 1 < % < 3 and all u £ {—1, 1} it holds 
that 



(A. 3) min(uE[/j], — uE[fi + i\) < 1 — 3e (with convention f^ = /i) 

and for all 1 < j < n it holds that 

(A.4) |{l<^<3:/f log{1/T))2 (/,)>r}|<l. 
Then we have 

PX(h,f 2 ,f 3 )>6. 

Moreover, one can take: 

5=1(^2)2+1/(2^ T = g C ^^\ 

8 

The proofs of the above two lemmas are almost identical to those given in [Mosl2j. The 
only difference is that instead of Theorem 11.10 of |Mosl2] we now use its modified version 
as follows. 
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Lemma A. 3. For every e > 0, there exist 5(e) > and t(5) > such that the following hold. 
Let fi, f2, f-s ■ {— 1, 1}™ —> [— 1, 1]< Assume that for all 1 < i < 3 and a// u € {—1, 1} /io/ds 
that 

(A. 5) min(uE[/j], — uE[/j+i]) < 1 — 3e (with convention f± = fi) 

and for all 1 < i < 3 and 1 < j < n it holds that 

lf s{lM (fi)<r, 

Then we have 

PX(h, f 2 , f 3 ) > 5. 

Moreover, one can take: 

4 

The proof of Lemma IA.3I depends on Gaussian Arrow's theorem (see Theorem 11.7 of 
|Mosl2j ) and the following generalization of some Gaussian invariance result proved in |Mosl2j 
(see Theorem 11.9). The latter may be of independent interest. 

Lemma A. 4 (Invariance principle). Let e > 0,-1 < p < 1. Then for every measurable 
function f : { — 1, l} n — > [—1, 1] there exists a measurable function f : M. n — > [—1, 1] such that 
the following holds for any n > 1. Let (X, Y) be distributed on { — 1, l} n x { — 1, l} n where 
(Xj,li)i<j< n are i.i.d. with Corr(A^,Y;) = p. Let 7 > be a lower bound for the smallest 
atoms of the random variables X; t and Y, L on { — 1, 1}. Consider (N,M) jointly Gaussian and 
distributed in M. n x M. n with (A^j,Mj)i<j< n are i.i.d. with 

E[Ni] = E[Mi] = 0, ELY?] = E[M?] = 1, E^M;] = P- 

Then 

• For the constant functions 1 and —1 it holds that 1 = 1 and —1 = —1. 

• If f and g are two functions such that for all 1 < i < n, it holds that 

max(/f° s ^(/),/f og ( 1 ^( 5 ))<r, 

then 

(A.6) \E[f(X)g(Y)} - E[f(N)g(M)] | < e, 

whenever 

r l°g(2/7) log(l/<0 

(A.7) r<e C TT^T « , 

for some absolute constant C > 0. 



Proof of Lemma A.J\ The proof is same as Theorem 11.9 of |Mosl2| . The only difference is 



that we now need to apply the version of Theorem 3.20 in [MOO10J under hypothesis H3 
instead of hypothesis H4. □ 

Proof of Theorem \10.1\ We will only give a brief sketch the proof of theorem for k = 3. The 
proof for k > 3 follows from a general argument given in |Mosl2j. 

Cloc(2/a) log(1/ '^ o) 

Take r = r(e) = 6 aS ° ,S = l(e/2) 2+1 ^ 2a ^ as in Lemma Q and r? = or/2. 

Let f a>b , f b>c , f c>a '■ { — 1, l} n — > {— 1, 1} be the three pairwise preference functions. Let 
r] = 5 (where the values of C will be determined later). We will consider three cases: 

• There exist two voters i ^ j £ [n] and two functions / {f a>b , f b>c , f c>a } such 

that 

(A.8) /? los(lA))2 (/)>,, lf°^ ))2 (9) > v 
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• For every two functions / 7^ g G {f a>b , f b>c , f c>a } and every i £ [re], it holds that 

(A.9) min^^ 1 ^)^/),/^ 10 ^ 1 ^) 2 ^)) < „. 

• There exists a voter f such that for all j 7^ j' 

(A.10) max(/j <(l°g(l/-)) 2 (/ a> fe)) ^^ < ^ 

First note that each F satisfies at least one of the three conditions (fOj) . or (lATOj) . 

Thus it suffices to prove the theorem for each of the three cases. 

In (IA,8j) . we have Infj(/) > rj and Inf,(g) > rj. By Lemma 110,21 combined with Barbera's 

Lemma |Bar80] (see Proposition 3.1 of [Mosl2| ). we obtian 

2-yr=s 4 
P(F) > ofr} 1 -^^ > a r\<*. 

We thus obtain that P(F) > 5 where 5 is given in (110. ip by taking large C. 

In case (1A.9|) . by Lemma I A. 2\ it follows that Either (if (IA.3[) does not hold) there exists 
a function G which always puts a candidate at top/bottom and D(F,G) < 3e, Or, P(F) > 

1^/2)2+1/(2^)^ 

Similarly in the remaining case (]A~T0j) . we have by Lemma [O that Either D(F,G) < 9e 
Or P(F) > i(e/2) 2+1 /( 2a2 ) > 5. The proof follows. □ 

Remark A. 5. Keller |Kelll| proved that one may take 5 = Ce 3 in the special case when g is 
uniform. It's an interesting open question to see whether such polynomial dependence of S 
on e holds for general distribution g. 

Appendix B. A lower bound for the NICD problem using a plurality function 

Proof of the lower bound for the NICD problem. We will analyze the protocol where all play- 
ers use some balanced plurality function PLU n that we are going to described below. De- 
fine rij = #{i : Xi = j} to be the number of times j is present in the string x and set 
R = { j £ f2 : nj = maxig^n;}. Then we define our pluraity function as 

PLU n (x) = x^ where = min{i : X{ E R}. 

Note that if j is the unique value in f2 which occurs most frequently in string x, that is, if 
R = {j}, then PLU n (x) = j. Also, note that if a is any permutation of f2 then 

PLU n (cr(xi), a(x 2 ), • • • , a(x n )) = <r(PLU n (a;)), 

which implies that PLU n is balanced. 

Define Wj = := E^lC^}-™"" 1 ) and W} = W>^ := n'^ £? =1 (l {w=J - } - 

m _1 ) where y is a p-corr elated of x. 

The probability of total agreement among k players is bounded below by the probability 
event that they all output 1 which is at least 



(B.l) E 



*{W{ > m&xW-lWj, l<j< m} k 



2 



Next we proceed to bound lim inf n P{W{ > maxj^i Wj| ^4} where A = {W% > |\/logfc and Wj < 
for all 2 < j < m}. Note that 

m 

¥{W[ > max W'AA] > ¥{W{ > Wlogk > \A} - VP{f' > Jlogk\A} 



J'=2 



F{W{ > y/fogklW! > l^f^k} - Y, ¥ i W j ^ < 0}. 

3=2 
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The last step is justified by the fact that Wj is a sufficient statistics for the conditional 
distribution of Wj given x. 
Note that for all 1 < j < m 

®l{xi=j} = ^{ 1 {y l =j} = m ~ l , Var(l {a , i=i} ) = Var(l te=j} ) = m _1 (l - mT l ) 

and for all 1 < j ^ f < m 



Cov(l {Xi=j} , l{ yi =j}) = pm 1 (l-m r ), Cov{l {Xi=j} , l{ Xi =f}) = -m~ 



(Wj,Wfi ^iV 2 (0,S) 



It now follows from multidimensional Central Limit Theorem that 



and 



(Wj,i<j <m)4iv m (o,r) 



3 ' 

as n — > oo, where iV 2 (0, £) (resp. iV m (0,r)) is the two-dimensional (resp. m-dimensional) 
normal distribution with mean zero and covariance matrix X (resp. T) given by 



T, = m - 1 (l-m- 1 ) (} P \ andT 



m 1 I m — m 2 11' 



Moreover, for any any convex regions Ri C R 2 and R 2 C R m , we have the Berry- Esseen- type 
error bound [S az81| as the following: 



(B.2) |P{(W^j) G R 1 }-¥{(Z 1 ,Z 2 ) G = 0( 

and 



n l ' 2 ) 



n 



-1/2^ 



(B.3) |P{(Wj, 1 < j < m) G i? 2 } - ¥{(Xj, 1 < J < m) G # 2 }| = 0( 

where (Zi,Z 2 ) ~ JV 2 (0 3 S) and (Xj,l < j < m) ~ jV m (0,r). From (lB~2l) . it follows that as 

n — > 00, 



and 

F{W^ > 0og&|Wj < 0} -> P{Z 2 > 0ogfc|Zi < 0}. 

Recall that the conditional distribution Z 2 given is N{pZ\, (T2.1) where 02.1 = (1 — 
p 2 )m~ l {l — m _1 ) < m _1 . Also recall that if N is a standard normal random variable, 
then 

F{N > x} < x^e~ x2 for x > 0, 
and this bound is sharp in the asymptotic sense 

p{iV > x} = e(a?~ 1 e~ a,2/2 ) as x -> 00. 

Now for any m > 2, 

p{z 2 > > g p V^ik} > F{^f i > -^IFK > 

> P {iV > -m0ogfc} > 1 - ^. 

Similarly, 

p{^ 2 > >/iogfc|zi<o} < i. 

Therefore, 

limPjW-f > maxMli} > 1 - I. 
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Consequently, for k > 2, we have 

]imM p (k,n) > (l - l) k liminf F{A\ 



> ^{Xt > ly/\ogk and Xj < for all 2 < j < m} [by (lB~3l) ]. 

The proof of the lower bound is now complete by Lemma IB. II □ 

Lemma B.l. Let {Xj, 1 < j < m) ~ N m {{),m~ 1 L m — m~ 2 l m l' m ). Fix d > 0. T/ien £/iere 
exists 72 = 72 (a!) > such that for k > 2, 



P{Xi > dy/logk and Xj < for all 2 < j < m} > c 2 {m)k~ 12 , 
where c 2 (m) — >• as m — > 0. 

Proof of Lemma \B.l\ Note that Xi + X 2 + . . . + X m = with probability one. Therefore, 

m 

P{Xi > dy/logk, Xj < Vj > 2} = Xj < -d^log k, Xj < Vj > 2} 

i=2 



(B.4) > P{X 2 < -d^logk, -1 <Xj<0 Vj > 3}. 

The conditional distribution of X 2 given Xj , j > 3 is given by 



m 1 



N -{m- l)m -1 J^-X"j,2m~ 2 (l 
Hence, it can be easily seen that 

m 

(lB~4l > ¥{X 2 + (m - l)m _1 J^Xj < -d^log fc - m}P{-l < Xj < 0, j > 3} 

J'=3 



> P{N < -dv^ogl-mjPl-l < Xj < 0,j > 3} 

where iV ~ N(0, 1). The lemma now follows from the normal tail estimate. □ 

Elchanan Mossel, Statistics and Computer Science, 367 Evans Hall, University of Califor- 
nia, Berkeley, CA, USA & Faculty of Mathematics and Computer Science, Weizmann Institute, 
Israel 

E-mail address: mossel@stat.berkeley.edu, elchanan.mossel@weizmann.ac.il 

Krzysztof Oleszkiewicz, Institute of Mathematics, University of Warsaw, ul. Banacha 2, 
02-097, Warsaw, Poland 

E-mail address: koles@mimuw.edu.pl 



Arnab Sen, Statistical Laboratory, DPMMS, Wilberforce Road, Cambridge, CB3 OWB, 
United Kingdom 

E-mail address: a.sen@statslab.cam.ac.uk 



